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1. INTRODUCTION 

Cancer is the deadliest disease among other diseases. Unfortunately, most cancer patients are 
diagnosed in the final stages of the disease. The revolution brought about by artificial intelligence in the 
health sector and other sectors have significantly reduced these risks [1]. Correct diagnosis of diseases is an 
arduous and time-consuming task that requires many years of medical training. There is a significant shortage 
of diagnostic experts in many medical specialties. and this places tremendous pressure and a considerable 
volume of work on the shoulders of doctors in addition to the costs and difficulties faced by patients to reach 
skilled doctors to diagnose their disease as fast as you can and maybe save their lives [2]. 

Machine learning is a machine’s ability to learn through continuous training of the data set (input) 
that the algorithm uses and repeats the training process to reach the desired results. Machine learning proves 
particularly useful when diagnostic information is available in digital forms, such as X-rays, computed 
tomography (CT) scans, magnetic resonance imaging, skin, and eye images. And deep learning algorithms 
are characterized by the ability to match patterns in images as they can be trained to detect different types of 
cancers in CT images [3], [4]. Thanks to artificial intelligence, shortly, all people will be able to reach the 
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same performance as the best experts in diagnosing diseases based on medical images at a low cost at any 
time [5]. 

Skin diseases are characterized by their many shapes, types, and colors, as they form on the surface 
of the human skin with different textures and colors [6]. When digital images of skin diseases are taken, they 
are stored in various data in the composition of their textures and color contrast. Many researchers presented 
research papers in which the distinction of different groups of skin diseases was discussed [7], [8]. 

Many deep learning architectures appeared [9], such as LeNet architecture [10], [11], which used 
the detection of benign and malignant tumors. AlexNet architecture appeared [12], [13], which distinguished 
multiple types of skin diseases and developed the ZFNet algorithm to deal with different types of diseases 
[14], [15]. The more complex VGG16 and VGGI19 architectures were built in the classification 
operations [16]. 


2. THE PROPOSED METHOD 

The proposed method seeks to design and develop a system based on computer vision to segment 
and classify skin lesions and extract a distinct set of features from skin lesions for adequate classification. 
There are many challenges in dealing with digital images of skin taken from digital cameras because they 
contain noises such as hair and bubbles. These noises are a real obstacle to obtaining accurate classification, 
and the model will be trained on such noise. 

The digital images used were dealt with based on color channels (red, green, blue). And the size of 
the images was based on 250*250 for each of the basal cell carcinoma and melanoma and nevus. Three 
channels of the dermatoscopy images (red, green, and blue) are used in each of the convolutional neural 
networks. Also, work was done to normalize the input images by subtracting the average pixel density and 
dividing the standard deviation. This treatment makes the mean value of the image become zero, and the 
standard deviation becomes 1. Images containing hair have been dealt with, which increases the ability to 
learn and thus improves the testing process. 

The images were obtained from the International Skin Imaging Collaboration (ISIC 2020) group, 
where the ISIC data provides a set of images of skin cancer [17]. The ISIC skin diseases project was 
implemented to reduce the increased mortality associated with skin cancers and the efficiency of early 
detection of skin cancer. This ISIC data set contains approximately 69,445 images. Download the training 
data used in this research from ISIC. The total of 17,846 images of dermatoscopy include 3,008 images of 
basal cell carcinoma and 5,272 images of melanoma, and 9,566 images of a birthmark, and Figure 1 shows 
samples of these images, Figure 1(a) sample of basal cell carcinoma, Figure 1(b) sample of melanoma, and 
Figure 1(c) sample of nevus. The validation data includes 20% of the images used. They are not classified. 
The final test data contains 1,500 anonymous images. In the ISIC challenge, ultra-pixel images and basic 
patient information, including age and gender, were presented (in both training and testing, these two criteria 
were eliminated for patient information) [18], [19]. 


& 
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Figure 1. Samples of training images (a) digital color image of a basal cell carcinoma (BCC) sample, 
(b) digital color image of a melanoma sample, and (c) digital color image of a nevus sample 


In this paper, two malignant pathologies of the skin were detected by comparison with the images of 
nevus. Basal cell carcinoma (BCC) is a malignant skin tumor stemming from skin cancer and is considered 
the most common form of skin cancer. Basal cells in the skin are small cells usually located in the upper part 
of the skin (the epidermis layer) [20]. One of the most prominent basal cell carcinoma features is a red spot 
or an irritated area on the chest, shoulder, arms, and legs. It usually arises on the place, a boil, a shiny bump 
or a small bright lump or transparent and glossy and sometimes pink, red, or white, and it can also be brown, 
yellowish, black, or brown and may be confused with a nevus [2], [11]. 
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Melanoma is the deadliest form of skin cancer. Changes in the skin, such as the formation of a mole 
or a pimple on the skin that grows in size or changes shape over time, as well as the appearance of an 
abnormally large mole, and the appearance of pale or red pimples on the skin, are all signs of melanoma 
cancer [12], [13]. The natural nevus has a prominent border that divides the mole from the surrounding skin 
and is usually a homogenous color such as tan, brown, or black. They are normally oval or spherical, with a 
diameter of less than a quarter of an inch (approximately 6 mm) when compared to the size of a pencil 
eraser [3], [15]. 

When working on specific complex tasks, it is tough to obtain the large amounts of data required to 
train the models. This problem is solved by applying different transformations to the available data to collect 
the new data. Figure 2 shows the data augmentation processes wherein Figure 2(a) original images, 
Figure 2(b) flipping images, Figure 2(c) cropping images, Figure 2(d) apply a Gaussian filter on images, 
Figure 2(e) Images resizing, Figure 2(f) images rotation by 90°, Figure 2(g) images rotation by 180°, 
Figure 2(h) images rotation by 270°, and Figure 2(i) images translating. 


Figure 2. Data augmentation (a) original image, (b) flipping, (c) cropping images, (d) Gaussian filter, 
(e) images resizing, (f) rotate images by 90°, (g) rotate images by 180°, (h) rotate images by 270°, 
and (i) images translating 


3. ALGORITHM 

A convolutional neural network refers to a type of artificial neural network with the main features of 
a cognitive neural network [17]. The convolutional neural networks (CNN) architecture uses three leading 
technologies: weight-sharing, local receptor fields, and spatial reduction. This network is somewhat similar to 
the biological vision system model in that raw data is applied to it as input to the network without the need 
for primary processing or feature extraction. Feature extraction and identification are performed in a standard 
structure [19], [20]. 

Poor lighting conditions can cause images captured in outdoor locations to deteriorate significantly. 
Low dynamic range and excessive noise levels in these photos can influence the overall effectiveness of 
computer vision systems. As demonstrated in Figure 3 technology was employed to improve low and high 
light photos to make the suggested algorithm more powerful in low and high light settings, where is 
Figure 3(a) Luminous and dark images, Figure 3(b) Invert the Images, and Figure 3(c) apply enhancing 
algorithm, and Figure 3(d) invert the enhanced images [21]. The process begins with inverting the low-light 
or high-light image, then applying the optimization algorithm to the inverted image, and then inverting the 
optimized image as in the following (1) and (2): 


O(x)=S(xX)B(x)+A(1-T(x)) (1) 
J(x)=(O(x)-Iv)/(max(t(x),t0))+Iv (2) 


where O is the observed intensity, S denotes the scene radiance, A denotes ambient light, and T denotes the 
portion of the light that reaches the camera. Estimating the atmospheric light A using a pre-dark channel, 
estimating the transmission map T, fine-tuning the estimated transmission map, restoring the image, and 
optionally conducting contrast enhancement are the steps involved in the enhancement process [22], [23]. 
Depending on deep learning, work is being done on building an intelligent model for data discovery 
and classification where the appropriate components can be dragged and dropped to create a suitable model. 
The proposed model consists of convolutional layers, activating functions, pooling layers, and fully- 
connected layers as in Figure 4. The digital images were taken consisting of 3 color channels, red, green, and 
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blue, as this type of digital image gives accurate data to digital images. The depth denotes the number of 
filters used for the wrapping process. Suppose the warping process is executed on the original image using 
three different filters (producing three characteristic maps for the input image). In that case, the depth of the 
properties map will be three [24], [25]. The stride pass-through is the number of pixels used when passing the 
filter matrix over the input image matrix. When the pass step value is 1, the filter will be moved only one 
pixel at a time, noting that when the pass-through step size is large, this means that it will produce small- 
sized character maps [26]. 

The pooling process reduces the size of spatial inputs continuously and in detail. It represents the 
inputs in the matrix of properties smaller in terms of the size of the matrix’s dimensions. Thus, it is easier to 
manage and work to reduce the volume of transactions and accounts in the network and thus control 
overfitting [27], [28]. The network is stable and resistant to minor changes, distortions, and shifts in the input 
image. It also helps in reaching a representation. That is a powerful feature because it helps identify the 
objects in the image regardless of where they are [29], [30]. Adding fully connected layer is an inexpensive 
way to teach non-linear components of these properties. Most of the properties extracted from the two layers 
of convolution and pooling may be suitable for classification. Still, the compounds composed of these 
properties may be better, and the sum of the probabilities extracted from the fully connected layer is equal to 
the value one. 


(b) (c) (d) 


Figure 3. Enhancing dark and bright images (a) luminous and dark original images, (b) invert the images, 
(c) apply enhancing algorithm, and (d) invert the enhanced images 
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Figure 4. The proposed architecture of the training process 
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The convolutional neural network architecture design and construction are characterized by the fact 
that it deals with digital images based on two essential bases: dealing with the different textures of digital 
images and the various colors of the entities within the digital images. In the proposed architecture, the 
tissues of images of skin diseases were dealt with in the stage of convolution. Figure 5 shows images of a 
skin disease that contains hair, where is Figures 5(a)-(c) they explain texture analysis of skin disease images, 
and Figure 5(d)-(f) they explain various colors for skin diseases. When convolution is applied, the tissues of 
the disease and the tissues of hair are distinguished. In this case, disease-specific features and features of the 
surrounding noise are obtained. The diversity of colors for skin diseases gave it a unique advantage by 
collecting colors and considering them as distinctive characteristics added to the features used in the 
classification process based on pooling operations. 


(@) (e) (f) 


Figure 5. Diversity of shapes and colors of skin diseases (a), (b), (c) texture analysis of skin disease images, 
(d), (e), (f) various colors for skin diseases 


4. RESEARCH METHOD 
The convolution and the pooling layers act as an extract of the properties from the input image. The 

fully connected layer works as a classifier for the input image using the extracted properties, as in Figure 6. 

The target class in the output is equal to one. At the same time, it is similar to zero for the rest of the types, 

and the convolution network training process can be summarized in the following steps: 

i) Insert the filters to be used and select parameters and weights from random values. 

ii) Insert training images into the network with finding the output potential for each class through 
forwarding propagation (convolution operations, ReLU, Pooling, forward propagation of the fully 
connected layer). 

— Assuming the probability of outputting the image is [0.3, 0.5, 0.2, 0.3]. 

— The resulting probability of the first training sample will be random because the initial weight will be 
random. 

iii) Calculate the total error in the output layer, as in (3): 


Total Error = Y%4(X— Y)? (3) 


where X=target probability, Y=output probability. 

iv) Use the backpropagation algorithm to calculate the error gradient according to the weight of the grid. 

— Weight update is affected by the total error. 

— The output probability of [0.2, 0.2, 0.8, 0.2] is nearest to the target vector [0, 0, 1, 0] for the same input 
image. This indicates that the network can classify this image by adjusting the filters and weights 
correctly. 
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— The filter matrix value and weights are updated and the number of filters, filter size, network structure, 
and other parameters are fixed. 

v) Repeat steps 1 through 4 for all images in the training data. 

vi) When a new (invisible) image is used as an input to the network, the network will perform the forward 
propagation process again and output the probabilities of each class (for the new image, the likelihood 
of output is improved over the previous training samples). If the training dataset is too large, the 
network will generalize the latest images and categorize them into the correct categories. 


15, 15,32 7,7, 64 
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Figure 6. Steps to implement the proposed convolutional neural network 


5. RESULTS AND DISCUSSION 

Various skin disease image samples were dealt with as in Figure 7 shows images of BCC that were 
taken in different shapes and colors, as two malignant skin diseases were detected, each separately compared 
to nevus images. It was found through the detection process that the accuracy of classification with BCC 
disease was higher than the classification accuracy with melanoma disease as shown in Figure 8, where is 
Figure 8(a) Shows BCC detection and Figure 8(b) Shows melanoma detection. Noting that the images of 
normal nevus are the same were applied to the two diseases and as shown in Tables 1 and 2. The essential 
feature of the proposed architecture is reaching the high accuracy of classification through the training 
process. By observing Figure 8, it is clear that the number of the epoch is 50, while the required accuracy was 
reached before that. The proposed architecture has also been adapted to deal with various images, different in 
shape and color, and the method of capturing the image of the disease. 


Figure 7. Various basal cell carcinomas 
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Figure 8. Results of training and verification processes (a) BCC detection and (b) melanoma detection 


Table 1. Accuracy for detecting BCC disease 


Architecture ee aa pe Samples ine ese eos Time for 50 epoch Accuracy 
or ce 
ec 
ee 
VGG-16 16 3 : ae Ca eels 
Arhiteetme of NN? 8 4 pee. 3008 GPUcTAMin 9825 % 
Table 2. Accuracy for detecting melanoma disease 
Architecture StLayers Layers Layers _S@™PIS__jsiCvarchive 2020) epoch__ACeuraey 
LeNet-5 5 2 a sy (GPU=9 Min 88.51 % 
AlexNet 8 5 . a ae Min 81.16 % 
ZENet 8 5 . aa (GPU=23 Min 64.73 % 
VGG-16 16 13 . 2. es pena Min 63.94 % 
Architectue of CNN? 8 Mae 5272 GPULTOMin__ 22-61 % 


6. CONCLUSION 

The accuracy rate in differentiation processes varies according to the data used and the similarity 
between that data. The results showed that the accuracy rate in detecting BCC disease was higher than the 
detection rate of melanoma disease. However, the percentage of accuracy in detecting both diseases was 
high. Also, the proposed architecture used in this research showed high efficiency in detecting malignant 
diseases. The accuracy ratio increases as the training steps increase (epoch), unlike some other architectures. 
They gain a constant accuracy that does not improve with the increase in training steps (epoch). This feature 


A proposed architecture for convolutional neural networks to detect skin cancers (Hasan Maher Ahmed) 


492 0 ISSN: 2252-8938 


increased the efficiency of the testing process concerning the randomly selected images for testing by 
obtaining high accuracy in the discrimination process. The proposed project is characterized by exemplary 
performance and high resolution of many of the images that were dealt with, and the significant difference 
between one image and another regarding the different shapes and colors of these diseases. Al-architecture 
also gave typical results with many input images, as the images were taken from the ISIC 2020 archive. 
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